Evaluation of Disk Allocation Methods for Parallelizing Spatial Queries on Grid Files‡

نویسندگان

  • Mark Coyle
  • Shashi
چکیده

Spatial Database Systems are characterized by large amounts of geometric and geographic data. Query response times in these systems are crucial, since these systems are often used interactively for decision support systems. The Grid file[1] is a well-known spatial access method that has great potential for parallelism, which reduces the response time of spatial queries for time-critical on-line applications of spatial databases. An important problem in parallelizing a grid file is the choice of a disk allocation method for mapping the data-blocks to multiple disks in the parallel secondary storage. This paper explores the disk allocation methods used to allocate the data pages of a grid file among a set of disks that can be accessed in parallel. In this environment, given N disks, a perfect allocation will speed up the processing of each query by a factor of N. The paper shows that no disk allocation is perfect for the set of all orthogonal range queries, even on uniformly distributed read-only data. We then introduce two families of allocation methods, namely the Linear allocation method and the Lattice allocation method, which are perfect for a large collection of interesting path queries (rows and columns) and range queries (small rectangles), on an interesting set of data distributions. We address the issues in extending disk allocation methods to general data distributions that are updated randomly. Finally, we provide experimental results on both the performance of the proposed methods and other well known disk allocation methods on different query sets, data distributions and data set sizes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Disk Allocation Methods for Parallelizing Grid Files

The grid file [1] is a well known access method for multi-dimensional and spatial data. The response time needed to process path and range queries on the grid file access method can be improved significantly by distributing the data pages over multiple disks. This paper explores the disk allocation methods used to allocate the data pages of grid file among a set of disks, which can be accessed ...

متن کامل

Perfect Allocation Methods for Spatial Queries in Parallel Disk Systems

A disk-allocation method assigns a disk-id to each unit of spatial data. Allocating spatial data over multiple disks to distribute the I/O cost of query processing uniformly over available disks can tremendously speed up the processing. An allocation method is called perfect for a query set if it balances the I/O load on each disk in processing any query in a query set. Some of the interesting ...

متن کامل

Study of Scalable Declustering Algorithms for Parallel Grid Files

Efficient storage and retrieval of large multidimensional datasets is an important concern for large-scale scientific computations such as long-running time-dependent simulations which periodically generate snapshots of the state. The main challenge for efficiently handling such datasets is to minimize response time for multidimensional range queries. The grid file is one of the well known acce...

متن کامل

An Optimal Disk Allocation Strategy for Partial Match Queries on Non-Uniform Cartesian Product Files

The disk allocation problem addresses the issue of how to distribute a file on to several disks to maximize the concurrent disk accesses in response to a partial match query. In the past this problem has been studied for binary as well as for p-ary cartesian product files. In this paper, we propose a disk allocation strategy for non-uniform cartesian product files by a coding theoretic approach...

متن کامل

Optimal Partitioning for Efficient I/O in Spatial Databases

It is desirable to design partitioning techniques that minimize the I/O time incurred during query execution in spatial databases. In this paper, we explore optimal partitioning techniques for spatial data for diierent types of queries, and develop multi-disk allocation techniques that maximize the degree of I/O parallelism obtained during the retrieval. We show that hexagonal partitioning has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995